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^ Abstract 

(N . . . ^ 

fSJ Motivated by the desire to cope with data imprecision 31 , we study methods for taking advantage of 

preliminary information about point sets in order to speed up the computation of certain structures associated 
with them. 

In particular, we study the following problem: given a set L of n lines in the plane, we wish to preprocess 
L such that later, upon receiving a set P of n points, each of which lies on a distinct line of L, we can construct 
the convex hull of P efficiently. We show that in quadratic time and space it is possible to construct a data 
I ^ I structure on L that enables us to compute the convex hull of any such point set P in 0{na(n) log* n) expected 

time. If we further assume that the points are "oblivious" with respect to the data structure, the running 
time improves to 0{na{n)). The same result holds when L is a set of line segments (in general position). 
We present several extensions, including a trade-off between space and query time and an output-sensitive 
algorithm. We also study the "dual problem" where we show how to efficiently compute the (< A;)-level of 
n lines in the plane, each of which is incident to a distinct point (given in advance). 

We complement our results by Q{n\ogn) lower bounds under the algebraic computation tree model for 
several related problems, including sorting a set of points (according to, say, their a;-order), each of which 
lies on a given line known in advance. Therefore, the convex hull problem under our setting is easier than 
sorting, contrary to the "standard" convex hull and sorting problems, in which the two problems require 
Q{n\ogn) steps in the worst case (under the algebraic computation tree model). 



Keywords: data imprecision, convex hull, planar arrangements, geometric data structures, randomized 
constructions 



1 Introduction 

Most studies in computational geometry rely on an unspoken assumption: whenever we are given 
a set of input points, their precise locations are available to us. Nowadays, however, the input is 
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after Preprocessing in Proc. 27th SoCG, pp. 11-20, 2011 
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often obtained via sensors from the real world, and hence it comes with an inherent imprecision. 
Accordingly, an increasing effort is being devoted to achieving a better understanding of data 



imprecision and to developing tools to cope with it (see, e.g., 31 and the references therein). The 
notion of imprecise data can be formalized in numerous ways |24 ^1 , 33 . We consider a particular 
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30 



32 , 34 . We are given a set of 



setting that has recently attracted considerable attention [8,20 
planar regions, each of which represents an estimate about an input point, and the exact coordinates 
of the points arrive some time later and need to be processed quickly. This situation could occur, 
e.g., during a two-phase measuring process: first the sensors quickly obtain a rough estimate of 
the data, and then they invest considerably more time to find the precise locations. This raises 
the necessity to preprocess the preliminary (imprecise) locations of the points, and store them in 
an appropriate data structure, so that when the exact measurements of the points arrive we can 
efficiently compute a pre-specified structure on them. In settings of this kind, we assume that for 
each input point its corresponding region is known (note that by this assumption we also avoid a 
point-location overhead). In light of the applications, this is a reasonable assumption, and it can 
be implemented by, e.g., encoding this information in the ordering of P. 

Related work 

Data imprecision. Previous work has mainly focused on computing a triangulation for the 
input points. Held and Mitchell [25] were the first to consider this framework, and they obtained 
optimal bounds for preprocessing disjoint unit disks for point set triangulations, a result that was 
later generalized by van Kreveld et al. i30i to arbitrary disjoint polygonal regions. For Delaunay 
triangulations, Loffler and Snoeyink [34| obtained an optimal result for disjoint unit disks (see 
also 20 32 ), which was later simplified and generalized by Buchin et al. |8| to /ai[^ and possibly 
intersecting regions. If n is the number of input regions, the preprocessing phase typically takes 
0(n log n) time and yields a linear size data structure; the time to find the structure on the exact 
points is usually linear or depends on the complexity (and the fatness) of the input regions. 

Since the convex hull can be easily extracted from the Delaunay triangulation in linear time, 
the same bounds carry over. However, once the regions are not necessarily fat, the techniques 
in f8ll34| do not yield the aforementioned bounds anymore. In particular, if the regions consist 
of lines or line segments, one cannot hope (under certain computational models) to construct the 
Delaunay triangulation of P in time o(n log n), regardless of preprocessing (see [22| and Section |4|. 
Nevertheless, if we are less ambitious and just wish to compute the convex hull of P, we can achieve 
better performance, as our main result shows. 

Convex hull. Computing the convex hull of a planar n-point set is perhaps the most fundamental 
problem in computational geometry, and there are many algorithms available [6 37 . All these 



algorithms require 0(nlogn) steps, which is optimal in the algebraic computation tree model js]. 
However, there are numerous ways to exploit additional information to improve this bound. For 
example, if the points are sorted along any fixed direction, Graham's scan takes only linear time [6]. 
If we know that there are only h points on the hull, the running time reduces to 0(n log /i) 111 



28 . If the points constitute the vertices of a given polygonal chain, the complexity again reduces 
to linear [36| . Our work shows another setting in which additional information can be used to 
circumvent the theoretic lower bound. 



^ A planar region o is said to be fat if there exist two concentric disks, -D C o C D', such that the ratio between 
the radii of D' and D is bounded by some constant. 
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Another somewhat related problem (albeit conceptually different) is the kinetic convex hull 
problem, where we are given n points which move continuously in the plane, and the goal is to main- 
tain their convex hull over time. Kinetic data structures have been introduced by Basch et al. [4] 
and received considerable attention in follow-up studies (see, e.g., [2] and the references therein). 
When the trajectories of the points are lines, our problem can be interpreted as a (perhaps, ex- 
tended and intricate) variant of the kinetic convex hull problem. Indeed, if the goal is to preprocess 
the linear trajectories such that the convex hull can be reported efficiently at any given time t, 
our algorithm applies (in which case the exact set of points P consists of their positions at time t) 
and yields a relatively simple solution. Nevertheless, our problem is more intricate than the kinetic 
convex hull problem for linear trajectories, as in our scenario there is no continuous motion that 
enables us to have a better control on the exact set of points (once they arrive) . 
Our results. We show that under a mild assumption (see Section 2.2) we can preprocess the input 
lines L such that given any set P of points, each of which lies on a distinct line of L, the convex 
hull CH(i-') can be computed in expected time 0{na{n)), where a(-) is the (slowly growing) inverse 
Ackermann function [ 41[ Chapter 2.1]; the expected running time is 0{na{n) log* n) without this 
assumption. Our data structure has quadratic preprocessing time and storage, and the convex hull 
algorithm is based on a batched randomized incremental construction similar to Seidel's tracing 
technique [40]. As part of the construction, we repeatedly trace the zone of (the boundary of) an 
intermediate hull in the arrangement of the input lines (see below for the definitions). The fact 
that the complexity of the zone is only 0{na{n)) (7||4l], and that it can be computed in the same 
asymptotic time bound (after having the arrangement at hand), is a key property of our solution. 
The analysis also applies when L is a set of line segments, and yields the same result. 

We also show that the analogous problem in which we just wish to sort the points according 
to their x-order imposes algebraic computation trees of depth J7(nlogn). Hence, in our setting 
convex hull computation is strictly easier than sorting, contrary to the "standard" (unconstrained) 
model, in which both problems are equivalent in terms of hardness (see, e.g., [6]). Our results 
can be extended with similar bounds to several related problems, such as determining the width 
and diameter of P, as well as time-space trade-offs and designing an output-sensitive algorithm. 
Unfortunately, already for the closest pair problem a preprocessing of the regions is unlikely to 
decrease the query time to o(n log n), demonstrating once again the delicate nature of our setting. 

In Section [3] we study a generalization of the problem under the dual setting. Specifically, we 
wish to preprocess a planar n-point set P such that given an integer k and a set L of lines, each 
of which is incident to a distinct point of P, we can find the "(< A;)-level" in the arrangement of 
L efficiently. We show a randomized construction whose expected running time is 0{na{n) + nk) 
under a mild assumption, and 0{na{n) log* n + nk) without this assumption. As above, our data 
structure has quadratic preprocessing time and storage. This improves over the 0{nlogn+nk) time 
algorithms in the traditional model 10 ,23], as long as A; = o(logn). Our approach is a non-trivial 
extension of the technique presented in Section [2| incorporated with the algorithms of Chan |10| 
and Everett et al. |23|, as well as the Clarkson-Shor technique |16| . 

The quadratic preprocessing time and storage might seem disappointing. However, a related 
lower bound by Ali Abam and de Berg ^ from the study of kinetic convex hulls (albeit providing 
a weaker evidence) suggests that quadratic space might be necessary, and that only relatively weak 



time-space trade-offs (as in Section 2.3) are possible in this model (see the discussion in Section 2.3 



for further details). Given the hardness of related problems, and the fact that previous approaches 
fail for "thin" regions, it still seems remarkable that improved bounds are even possible. 
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2 Convex Hulls 



Preliminaries. The input at the preprocessing stage is a set L of n lines in the plane. A query to 
the resulting data structure consists of any point set P such that each point lies on a distinct line 
in L, and for every point we are given its corresponding line. For simplicity, and without loss of 
generality, we assume that both L and P are in general position (see, e.g., j6||4l]). We denote by 
CH(P) the convex hull of P, and by E[P) the edges of CH(P). We represent the vertices of CH(P) 
in clockwise order, and we direct each edge e G E[P) such that CH(P) lies to its right. Given a 
subset Q C P, a point p £ P\Q, and an edge e S E{Q), we say that e is in conflict with p ii p lies 
to the left of the line supported by e. The set of all points in P \ Q in conflict with e is called the 
conflict list Cg of e, and its cardinality is called the conflict size Cg of e. 

In what follows we denote the arrangement of L by A{L), defined as the decomposition of the 
plane into vertices, edges and faces (also called cells), each being a maximal connected set contained 
in the intersection of at most two lines of L and not meeting any other line. The complexity of a 
face / in A{L) is the number of edges incident to /. The zone of a curve 7 consists of all faces that 
intersect 7, and the complexity of the zone is the sum of their complexities. 



2.1 The Construction 

Preprocessing. We construct in O(n^) time (and storage) the arrangement A{L) of L, and 
produce its vertical decomposition, that is, we erect an upward and a downward vertical ray through 
each vertex v of A{L) until they meet some line of L (not defining v), or else extend to infinity. 
Queries. Given an exact point set P = {pi, . . . ,pn} as described above, we obtain CH(P) through 
a batched randomized incremental construction. Let Pi Q P2 Q ■ ■ ■ Q ^log* n = be a sequence 
of subsets, where Pk-i is a random sample of Pk of size z^-i := min{ [n/ log^'^""^^ nj , n}, for k = 
2, ...,log*n0 This sequence of subsets is called a gradation. The idea is to construct CH(Pi), 
CH(P2)) • • CH(Piog*„) one by one, as follows. First, we have |Pi| = 0(n/logn), so it takes 0(n) 
time to find CH(Pi), using, e.g., Graham's scan j6]. Then, for k = 2,. . . ,log* n, we incrementally 
construct CH(Pfc) by updating CH(Pfc_i). This basic technique was introduced by Seidel pO] and 



it has later been exploited by several others 13,19, 38 



To construct CH(Pfc) from CH(Pfc_i), we use the data structure from the preprocessing to 
quickly construct the conflict lists of the edges in E{Pk-i) with respect to Pk- In the standard 
Clarkson-Shor randomized incremental construction [Tg] it takes 0(n log n) time to maintain the 
conflict lists. However, once we have the arrangement A{L) at hand, this can be done significantly 
faster. 

In fact, we use a refinement of the confiict lists: we shoot an upward vertical ray from each 
point on the upper hull of Pk-i, and a downward vertical ray from each point on the lower hull. 
Furthermore, we erect vertical walls through the leftmost and the rightmost points of CH(Pfc_i). 
This partitions the complement of CH(Pfc_i) into vertical slabs ^(e), for each edge e G E{Pk-i), 
and two boundary slabs S{vi), S{vr), associated with the respective leftmost and rightmost vertices 
vi and Vr of CH(Pfc_i). The refined conflict list of e, C*, is defined as C* := {Pk \ Pk-i) r\S{e). We 
add to this collection the sets C*^ := {Pk \ Pk-i) n S{vi) and C*^ := {Pk \ Pk-i) n S{vr), which we 
call the refined confiict lists of vi and Vr, respectively. Note that C| C Cg, for every e G E{Pk-i). 



^ Here, log*-'' n is the ith iterated logarithm: log'"' n — n and log''"' n — log(log'''' n). The standard notation 
log* n is the smallest k such that log'''' n < 1. 
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Moreover, C*^ (resp., C*^,) is contained in Cg^ U Cgj, where 61,62 G E[Pk_i) are the two respective 
edges emanating from vi (resp., Vr); see Figure [l]!|a). We now state a key property of the conflict 



Usts Ce (this property is fairly standard and follows from related studies [13 16 38| 



Lemma 2.1. Let Q he a planar m-point set, r a positive integer satisfying 1 < r < m, and Q a 
random subset of size r. Suppose that /(•) is a monotone non- decreasing function, so that f{x)/x'^ 
is decreasing, for some constant c > 0. Then 

Exp[ /(ce)] =0(r- f{m/r)), 

eeE{R) 

where the constant of proportionality depends on c, and Cg is the number of points p £ Q \ R in 
conflict with e £ E{R). □ 

In other words, the above lemma implies that, on average, the size of the conflict list of a fixed 
edge e G E{R) is m/r (this can easily be seen by setting /(•) to the identity function, and obtaining 
an overall linear size). 

Constructing the refined conflict lists. We next present how to construct the refined conflict 
lists at the k-th round of the algorithm. We first construct, in a preprocessing step, the refined 
conflict lists C*^, C*^ in overall 0{zk) time. We call these points the extreme points, and for the 
sake of the analysis, we eliminate these points from for the time being, and continue processing 
them only at the final step of the construction — see below. 

Let UH(Pfc_i) be the upper hull of Pk-i, and let LH(Pfc_i) be its lower hull. Having these 
structures at hand, we construct the zones of UH(Pfc_i) and LH(Pfc_i) in A{L). This takes overall 
0{na{n)) time, using the vertical decomposition of A{L) and the fact that the zone complexity of 
a convex curve in a planar arrangement of n lines is 0(na(n)); see Bern et al. [T] and Sharir and 
Agarwal [41| Theorem 5.11]. 

As soon as we have the zones as above, we can determine for each line £ £ L the edges e G 
E{Pk-i) that (. intersects (if any). Let Li be the lines that intersect CH(Pfc_i), and put L2 := L\Li. 
(At this stage of the analysis, we ignore all lines corresponding to points in Pk that were eliminated 
at the time we processed the extreme points.) 

Next, we wish to find, for each point p G Pk \ Pk-i the edges in E{Pk-i) in conflict with p. If 
p lies inside CH(Pfc_i), there are no conflicts. Otherwise, we efficiently flnd an edge Cp G E(Pk-i) 
visible from p, whence we search for the slab S{e*) containing p — see below. 

Let us flrst consider the points on the lines in Li. Fix a line i € Li, let p & P he the point on 
i, and let gi, q2 be the intersections between £ and the boundary of CH(Pfc_i). The points qi, q2 
subdivide £ into two rays pi, p2, and the line segment qTq2- By convexity, qiq2 C CH(Pfc_i) and 
the rays pi, p2 lie outside CH(Pfc_i). Hence, if p lies on qTq2, it must be contained in CH(Pfc). 
Otherwise, p sees an edge of E^P^-i) that meets one of the rays pi, p2, and we thus set Cp to be 
this edge (which can be determined in constant time); see Figure [T]^b) . 

We next process the lines in L2. Note that all points on the lines in L2 conflict with at least 
one edge in E{Pk-i), since no line in L2 meets CH(Pfc_i). To find these edges we determine for 
each £ £ L2 a vertex pi on the boundary of CH(i-*jt_i) that is extreme for £^ This can be done in 
total time 0{n) by ordering E{Pk_i) and L2 according to their slopes (the latter being performed 



By this we mean that pi is extremal in the direction of the outer normal of the halfplane that is bounded by 
and contains CH(Pfc_i). 
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(a) (b) (c) 



Fig. 1: (a) The conflict list Ce of the edge e S E(Pk-i) contains ah the hghtly-shaded points, 
whereas the refined conflict hst C*(e) has only those points in the vertical slab S{e); (b-c) 
The edge Cp of E{Pk-i) is visible to p when (b) £ intersects CH(Pfc_i), or (c) i does not meet 
CH(Pfc_i). In this case pi is an extreme vertex for the direction it, and the two dashed 
lines depict the visibility lines between p and the two respective endpoints of Cp. 

during preprocessing), and then merging these two lists in linear time. Next, fix such a line i €z L2, 
and let p G ^ be a query point, then p must see one of the two edges in E{Pk_i) incident to pi 
(which can be determined in constant time given p^), and we thus set Cp to be the corresponding 
edge; see Figure [T](c) . 

We are now ready to determine, for each point p G Pk outside CH(Pfc_i), the slab 5(e*) that 
contains it (note that e* must be vertically visible from p). If Cp is vertically visible from p, we set 
e* := Cp. Otherwise, we walk along (the boundary of) CII(Pfc_i), starting from Cp and progressing 
in the appropriate direction (uniquely determined by p and Cp), until the slab containing p is 
found. Using cross pointers between the edges and the points, we can easily compute C* for each 
e G E{Pk-i). By construction, all traversed edges are in confiict with p, and thus the overall time 
for this procedure is proportional to the total size of the confiict lists Ce- Recalling that Ce = |Ce|, 
we obtain 



Exp[ 



ee£;(Pft_i) 



0{zk) = 0{n) 



by Lemma |2.1| with f : m ^ m. This concludes the construction of the refined conflict lists. 
Computing CH(i-'jt). We next describe how to construct the upper hull of Pyt, the analysis for the 
lower hull is analogous. Let (ei, . . . , eg) be the edges along the upper hull of Pk-i, ordered from 
left to right. For each ei, we sort the points in C|. according to their x-order, using, e.g., merge 
sort. We apply the same procedure for the extreme points. We then concatenate the sorted lists 
C*^,C*^,C*^, . . . ,C*^,C*^, and merge the result with the vertices of the upper hull of Pk-i- Call 
the resulting list Q, and use Graham's scan to find the upper hull of Q in time 0(|(5|)- This is 



also the upper hull of P^- Applying once again Lemma 2.1 with f : m ^ mlogm, and putting 
■= 1^*11, c*j := |C'vJ) c*^ '■= |C'«rl' ^ > an absolute constant, the overall expected running 
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time of this step is bounded by 



Exp [A ■ {cl^ log cl^ + cl^ log cl^ + ^ cl log c* )] 

eeE(Pfe_i) 

<Exp[3A- ^ CelogCe] = 0{zklog{zk/zk-l)) 
ee£;(Pfc_i) 

= O (^(n/logC^^njlog (log('=-^)n/log('=)n)) = 0(n), 

because by definition C* C Ce, so c* < Ce, and c*^ < + for two edges ei, 62 of E{Pk-i) (and 
similarly for c*^). In total, we obtain that the expected time to construct CH(Pfc) given CH(Pfc„i) 
is 0{na{n)), and since there are log* n iterations, the total running time is 0{na{n) log* n). 

We note that the analysis proceeds almost verbatim when L is just a set of line segments in the 
plane. In this case, we preprocess the lines containing the input segments, and proceed as in the 
original problem. We have thus shown: 

Theorem 2.2. Using 0{n^) space and time, we can preprocess a set L of n lines in the plane 
(given in general position), such that given any point set P with each point lying on a distinct line 
in L, we can construct CH(P) in expected time O {na{n) log* n). The same result holds if L is a 
set of n line segments whose supporting lines are in general position. 



Remark. An inspection of the proof of Theorem 2.2 shows that the total expected conflict 
size, over all iterations k, is only 0{n). 

2.2 Better Bounds for Oblivious Points 

We now present an improved solution under the obliviousness model, where we assume that the 
points are oblivious to the random choices during the preprocessing step. Specifically, this implies 
that an adversary cannot pick the point set P in a malicious manner, as it is not aware of the 
random choices at the preprocessing step. This fairly standard assumption has appeared in various 
studies (see, e.g., [H[Il]). In the discussion at the end of this section we describe this issue in more 
detail. 

Preprocessing. We now construct a gradation i^i ^ ^ • • • C Li+iogiogn = L oi the lines 
during the preprocessing phase, where the set sizes decrease geometrically. Specifically, |Li| = yi = 
\nj log n] , and for = 2, . . . , 1 + log log n, is a random subset of of size 







2 





1 n 



2 'j\o^\o^n—k-'r\ 



(1) 



We construct each arrangement AiL}^ in 0(n^2^'^^^/ log^ n) time, for a total O(n^) time over all 
gradation steps. 

Query. Given an exact input P, we first follow the gradation produced at the preprocessing stage, 
and generate the corresponding gradation Pi C P2 ^ • • • ^ A+iogiogn = P ^ where Pfc = P Ci L^, 



for all k. By the obliviousness assumption, each Pk-i is an unbiased sample of P^, so Lemma 2.1 
applies (see once again the discussion below). Moreover, the key observation is that in order to 
obtain CH(Pfc) from CH(Pfc_i), it suffices to confine the search to the arrangement A{Lk) instead 



of the entire arrangement A{L) as in Section 2.1 Thus, we first construct CII(Pi) in 0{n) time 



as before. Next, to obtain CH(Pfc) from CH(Pfc_i), we construct the zones of UH(Pfc_i) and 
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Fig. 2: Illustrating the obliviousness assumption. The set L contains all the lines in the figure. The 
set V of the dashed lines depicted in the figure is a random subset of L. If the adversary 
knows this random subset L', it can place the points pi, . . . as illustrated in the figure, 
thereby constructing CH(pi, ■ ■ ■ ,Pa) first. Then each edge on the upper hull UH(pi, . . . ,^4) 
is in conflict with each of the remaining points ps, . . . ,p8. 



LH(Pfc_i) in A{Lk) in 0{yka{yk)) time, and then compute the refined confiict lists just as in 
Section |2.1[ The overall expected time to produce these lists is 0{yk), totaling 0{n) over all 
steps. As in Section [2?H the expected time (of the final step) to compute Ull(Pfc) and LH(Pfc) is 

0{yklog{yk/yk^i)) = 0{yk), 

by ([1]). Thus, the expected running time at the A;th step is dominated by the zone construction, 
so the overall expected running time is 

O (^l^fc=i°^^°^" yA:a(yA:)^ = O {na{n)), as is easily verified. Thus, 

Theorem 2.3. Using 0{n'^) space and time, we can preprocess a set Lofn lines in the plane, such 
that for any point set P with each point lying on a distinct line of L, we can construct CII(P) in 
expected time 0{na{n)) assuming obliviousness. 

Discussion. The issue captured by the obliviousness assumption is: how much does the adversary 
know about the preprocessing phase? If the adversary manages to obtain the coin flips performed 
during the preprocessing stage, then this enables a malicious choice of the input. This phenomenon 
is particularly striking in the case of hashing: if the adversary knows the random choice of the 
hash function, a bad set of inputs can hash all keys to a single slot, completely destroying the hash 
table. On the other hand, if the adversary is oblivious to the hash function, the expected running 



time per operation is only 0(1); see, e.g., 18, Chapter 11]. 

In our model we encounter a similar phenomenon. Even though the impact is not as disastrous 
as for hashing, assuming obliviousness for the adversary can improve our running time by a factor of 
0(log* n). To illustrate the effect of obliviousness in our setting, consider the scenario illustrated in 
Figure Ij In this case, we have a set L of lines and a random subset L' C L. The adversary can pick 
the point set P so that P' := P n L' is a biased sample of P in a sense that violates the properties 
of Lemma [2. l[ In particular, the total number of conflicts between the edges of CH(P') and P may 
become quadratic, which makes the random incremental construction inefficient. Nevertheless, if 
the adversary is oblivious with respect to the sample, the points in P' behave as an unbiased sample 
of P. 
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2.3 Extensions and Variants 



Diameter- and width-queries. Given CH(P), we can easily compute the diameter (i.e., a pair of 
points witli maximum Euclidean distance) and the width of P (a strip of minimal width containing 
all the points in P) in linear time (see, e.g., |37| Chapter 4]). Hence, 



Corollary 2.4. Using O(n^) space and time, we can preprocess a set L of n lines in the plane, 
such that given a point set P with each point lying on a distinct line of L, the diameter or width 
of P can be found in expected time 0{na{n) log* n). The expected running time becomes 0{na{n)) 
assuming obliviousness. 

A trade-ofF between space and query time. Our data structure can be generalized to support 
a trade-off between preprocessing time (and storage) and the query time, using a relatively standard 
grouping technique (2||9], described as follows. 

Preprocessing. Let 1 < m < n be a parameter, and, without loss of generality, assume that 
n/m is an integer. We partition L into m subsets Li, . . . ,Lm of size n/m each, and construct the 
arrangements ^(L^.), for k = 1, . . . ,m, in overall time and storage 0{n^ /m) (cf. (2||9]). 
Query. Given an exact input P, we first construct CH(Pfc), where Pk is the subset of points 
on the lines in Lk, k = l,...,m, in 0{{n/m)a{n/m) log* (n/m)) (assuming obliviousness it is 
0{{n/m)a{n/m))) expected time, for a total expected time of 0(na(n/m) log* (n/m)) (resp., 
0{na{n/m))) over all these subsets. Having CH(Pfc) at hand for all k, we merge UH(Pi), 
UH(Pm) in 0(n log m) time [l8], thereby producing a list Q of points sorted according to their 
x-order. We then use Graham's scheme to construct the upper hull of Q (and thus of P) in 0(|Q|) 
time. We produce the lower hull of P in an analogous manner. We have thus shown: 

Corollary 2.5. Fix 1 < m < n. In total 0{n'^/m) time and space, we can preprocess a set L of n 
lines in the plane, such that given a point set P with each point lying on a distinct line of L, we can 
construct CH(P) in expected time 0(n(logm + a(n/m) log* {n/m))). The running time becomes 
0{n{logm + a{n/m))) assuming obliviousness. 



Note that for small values of m, Corollary 2.5 in fact yields an improvement over Theorem 2.2 



Spe cifica lly, by setting m := 2°("'\ we have that the space and preprocessing requirement in Theo- 
can be lowered to 0(n^/2"('^)), while the expected query time remains 0{na{n) log*(n)). 



rem 



2.2 



Discussion. As noted in the introduction, the bounds in Corollary |2 . 5| are somewhat disappointing. 
However, the study by Ali Abam and de Berg [2] might provide (albeit, weak) evidence that these 
bounds are unlikely to be improved. Indeed, they have studied the kinetic sorting problem, where 
we are given a set of n points moving continuously on the real line, and the goal is to maintain 
a structure on them so that at any given time the points can be sorted efficiently. Ali Abam and 
de Berg [2] showed that even when the trajectories of these points are just linear functions, then 
under the comparison graph model (see [2] for the definition) one cannot answer a query faster 
than cn log m time using less than c'n^/m preprocessing time and storage, for appropriate absolute 
constants c, c' > 0. As discussed in [2], this may indicate that better trade-offs for the kinetic convex 
hull problem seem unlikely. Nevertheless, it may not provide a rigorous proof, as the analysis for 
the kinetic sorting problem strongly relies on the one-dimensionality of the points, and does not 
work for points in the plane, at least under the context of the proofs given in p]. Still, we have 
chosen to present those details in this paper, as we tend to believe that bounds of this kind could 
also apply to our problem (which is even more difficult than the kinetic convex hull problem, as 
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described in the introduction), and that a rigorous analysis could stem from the approach in [2]. 
This would imply that the trade-off bounds given in Corollary |2.5| are nearly optimal. 
An output-sensitive algorithm. Our algorithm can be made sensitive to the size h of the convex 
hull by adapting a technique of Ali Abam and de Berg [2] that uses gift wrapping queries. The 
setting for queries of this kind is as follows. Let Q be a point set, given an arbitrary point p (not 
necessarily from Q) and a line £ through p, such that all points of Q lie on the same side of £, report 
a point q £ Q that is hit first when i is rotated around p (say, in clockwise direction). 
A search on the value of h. Since the output size h is not given in advance, we perform a search 
on its actual value, over at most log* n — 1 iterations, in the query step, and apply all tested values 
at the preprocessing step, as described below. The tested values of h are chosen in the following 
manner. Let hi be the value of h at the ith round. Initially, hi = 1, and put hi = 2^*"^^ for i > 2, 
where 2^ ') is the power-tower function. We continue the search as long as hi < logn; let t be the 
number of rounds thus obtained. By construction t < log* n — 1. When hi > logn, we stop the 
search and resort to the bound in Theorem 12.21 — see below. 
Preprocessing. At each round i = 1, . . . ,t, we set a parameter rui to be 

J n 

rui := maxi 1, — 

[ hi log hi 

(i) (i) 

and partition L into rui roughly equal subsets L\ , . . . , ■ We then proceed in a similar manner 
as described earlier for the trade-off between space and query time. That is, for each /c = 1, . . . , m^, 
we construct the arrangement A{L^^) in overall time and storage 0{r? jmi). 
The total time and storage consumed over all rounds i is thus 



O [ + ^ n/ij log hi ] = Oir? 



\ i=l / 

since the sum over the rounds i is dominated by the last term, which is 0(n logn log logn). 
Query. Given a point set P with each point lying on a distinct line of L, we construct CH(P) in 
an output-sensitive manner, as follows. 

At the ith round, let P^^ := P n hf , for A; = 1, . . . ,m,. Construct CH(p|^^), CH(P2^*^), . . ., 
CH(Pm]), as in Section 2.1 This takes total time 0(na(n/mi) log* (n/nij)) = 0(na(/ij) log*(/ij)) 



(resp. 0{na{n/mi)) = 0{na{hi)) assuming obliviousness). 

The primitive operation we would like to obtain is a gift wrapping query on P. To this end, we 
perform standard gift wrapping queries for each subset P^^ in 0(log {n/mi)) time (see, e.g., pT]). 
This yields a set of rrii candidates, from which we produce the final answer to the query. In total, 
a gift wrapping query takes 0(mi log (n/mj)) = 0(n//ij) steps. 

We now attempt to construct CII(P). We begin with a gift wrapping query for the leftmost 
vertex p of P and the vertical line £p passing through p. This yields a pair {p',i'), where p' is the 
first point hit by £, and i' is the line through p and p' . We continue until (i) we hit p again, or (ii) 
we have performed hi gift wrapping queries. This results in a running time of 

O (hi ■ nijlog {n/rrii)) = O ( /i, • ^ ) = 0(n). 



The round succeeds if we reach p. Otherwise it fails, and we proceed to round i + 1. After 



t < log* n — 1 unsuccessful rounds (i.e., if hi > logn), we compute CH(P) directly via Theorem 2.2 
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Fig. 3: The 2-level in an arrangement of lines. 



It is easy to verify that the actual number of rounds that we need is at most 0(log* h). Combin- 
ing the bounds above, it follows that the overall query time is 0(na(/i) (log* /i)^) (or 0{na{h) log* h) 
assuming obliviousness), as asserted. We have thus shown: 

Corollary 2.6. In total 0{n'^) time and space, we can preprocess a set L of n lines in the plane, 
such that given a point set P with each point lying on a distinct line of L, CH(P) can be found in 
expected time O {na{h) {log* h)'^), where h is the output size. The expected running time becomes 
0{na{h) log* h) assuming obliviousness. 



3 Levels in Arrangements 

Preliminaries. Let L be a set of n lines in the plane (in general position). Given a point p, the 
level of p with respect to L is the number of lines in L intersected by the open downward vertical 
ray emanating from p. For an integer fc > 0, the k-level of the arrangement A{L), denoted by 
levfc(L), is the closure of all edges of A{L) whose interior points have level k with respect to L. 
It is a monotone piecewise- linear chain. In particular, levo(L) is the so-called "lower envelope" of 
L; see, e.g., 41, Chapter 5.4] and Figure [sj The (< k)-level of A{L), denoted by lev<fc(L), is the 



complex induced by all cells of A{L) lying on or below the fc-level, and thus its edge set is the union 



of levj(L) for i = 0, . . . , A:; its overall combinatorial complexity is 0{nk) (see, e.g., 16,|4l|). 

In what follows we denote by Vq{M) (resp., V<q{M)) the set of vertices of leVg(M) (resp., 
lev<g(M)), where g > is an integer parameter and M is a set of lines in the plane. It is easy to 
verify that the combinatorial complexity of leVg(M) is at most 0(1 + 1 V^(M)|) (see once again 



41 



Throughout this section, we use the Vinogradov-notation: f g means / = 0{g) and f ^ g means 
/ = ^{g). In addition, we write Expj(^[-] to emphasize that we take the expectation with respect 
to the random choice of X (the other variables are considered constant). 

The best currently known bound for the worst-case complexity of levfc(L) is 0{nk^^ ^) |2l| . 
Nevertheless, since the overall combinatorial complexity of, say, lev<2A;(-L) is only 0{nk) [16], it 
follows that the average size of Vi[L), for each i = k,. .. ,2k, is only 0{n). Specifically, we have 
(see also f23lj for a similar property): 

Claim 3.1. Let k be a random integer in the range {k, . . ., 2k}. Then, for any subset S C L, we 
have 

Exp^[|14(S)|]«|5|. 
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Proof. The claim follows from the observation that the total size of V<2k{S) is 0{\S\k), and each 
vertex appears in exactly two consecutive levels of A{S). □ 

The problem. In the sequel we study the following problem. We are given a set P = {pi, . . . ,Pn} 
of n points in the plane (in general position), and we would like to compute a data structure such 
that, given any set L = {ii, . . . , in} of n lines satisfying pj G ^j, for z = 1, . . . , n, and any parameter 
A; > 0, we can efficiently construct lev<fc(L). This is a natural generalization of the problem studied 



in Section 2.1 Indeed, let us apply the standard duality transformation, where a line / : y = ax + b 
is mapped to the point /* = (a, —b), and a point p = (c, d) is mapped to the line p* : y = cx — d 
(see, e.g., ^ Chapter 8]). Then levo(-L) in the "primal" plane is mapped to the (upper) convex hull 
of the points L* in the "dual" plane. Everett et al. [23] showed that lev<fc(L) can be constructed 
in 0(n log n + nk) time, and that this time bound is worst-case optimal (see also pO]). We show: 

Theorem 3.2. Using 0{'n?) space and time, we can preprocess a set P of n points in the plane, 
such that given a set L of lines with each line incident to a distinct point of P, lev<fc(L) can be 
computed in expected time 0(na(n) (log* n — log* A;) + nk). The expected running time becomes 
0{na{n) + nk) assuming obliviousness. 



Theorem |3.2| improves the "standard" bound of 0{nlogn -\- nk) for any k = o(logn). We 
combine ideas from Chan's algorithm for constructing (< /c)-levels in arrangements of planes in 



M"^ [lOj with the technique of Everett et al. 23 . The preprocessing phase is fairly simple, but the 
details of the query processing and its analysis are more intricate. We begin with an overview of 
the approach, and then describe the query step and its analysis in more detail. 
An overview of the algorithm. The main ingredients of the algorithm are as follows. 
Preprocessing. Compute the arrangement A{P*) of the lines dual to the points in P (and produce 
its vertical decomposition) in O(n^) time and storage. 

Query. We are given a set of lines L as above, and an integer k > 0. If > logn we use the 
algorithm of Everett et al. ^23j to report lev<fc(L) in 0{n\ogn + nk) = 0{nk) time. Otherwise, we 
compute a gradation Li C L2 C • • • C Liog* n-iog* fc+i C L of L. The sizes of the subsets Li are 
similar to those presented in Section [2. 1| for the dual plane, but as soon as the number of lines in a 
subset of the gradation exceeds [n/A], we complete the sequence in a single step by choosing the 



next subset to be the entire set L. As in Section 2.1, we set \Li\ := [n/logn]. 

We choose a random integer k G {k, . . . , 2k}. Then, at the first iteration, we construct lev^^(-Li) 
in 0{nk) time, using the algorithm in [23]. At each of the following iterations we construct 
lev^^(Lj) from lev^^(Lj_i) (at the final step, we construct lev^^(L) from ^j.[L\og* n-\og* k+i)) ■ 
As observed above, the random choice of k guarantees that the expected complexity of each lev^(Li) 
is only linear in |Lj|,|^for each z = 2, . . . , log* n — log* fc + 1, which is crucial for the analysis. Finally, 
we eliminate from lev^^(L) all portions lying above the (actual) fe-level, in order to obtain the final 
structure lev<fc(L). 

To construct lev^^(Lj) from lev^^(Li_i), we would like to proceed as follows. We compute 
UII(lev^^(Lj_i)) and subdivide it into semi-unbounded (in the negative y-direction) trapezoidal 
cells. The first goal is to find for each such cell A the set of lines Ca ^ Li which are in conflict 
with A (that is, A n £ / 0, for each I G Ca)- This goal is achieved by mapping UH(lev^^(Lj_i)) 

^ We use the same value of k throughout the entire process, since the expected complexity of the fc-level remains 
linear in each iteration i. By linearity of expectation, the overall expected size of the various fc-levels is linear in 

^log* n — log* k + 1 I 



12 



to the dual plane and walking along its zone in A{P*). The dual of UH(lev^^,(S')) is a concave 
chain 7 (the lower envelope of the lines dual to the vertices of UH(lev^^(S'))), where each vertex 
V of UH(lev^^(5)) is mapped to an edge v* of 7 and each edge e is mapped to a vertex e* of 7. 
Moreover, a line £ € L below a vertex v of UH(lev^^(S')) is mapped to a point i* (on some line of P*) 
above the corresponding edge v* of 7. As is easily verified, such a line i intersects UH(lev^^(S')). 
Otherwise, if i lies above all the vertices of UH(lev^^(5')), then if] UH(lev^^(S')) = 0, and this 
implies that £* lies below 7 in the dual plane. See Figure |4]^a)-(b). 

Having the lists Ca at hand, we construct for each A the structure lev^^(Lj) clipped to A 
by (i) constructing lev^(CA) (clipped to A); (ii) clipping each line £ G Ca to its portion that lies 
below UH(lev^(CA)n A); (iii) constructing the arrangement of these portions within A (as observed 
the actual level of these portions in A{Li) does not exceed 2k — 1); and (iv) eliminating 



m 



23 



from the arrangement just computed all portions lying above lev^(CA) n A. Finally, we glue the 
resulting structures together and report lev^^(Lj). 

However, it would be too expensive to process each conflict list Ca individually. Therefore, a 
crucial ingredient of the algorithm is to consider blocks instead of just individual cells. Specifically, 
we gather contiguous cells into blocks and process them all together. This partition is the key to 
reducing the number of cells considered in the update step; see Figure [5| The bulk of the analysis 
lies in a careful balancing between the block sizes and their overall number, and in particular 
showing that blocks with large confiict lists are scarce. 

3.1 Query Processing 

We now describe the query process and its analysis in more detail. We first follow a gradation as 
described in the overview, and then proceed to the update step. 

The update step. From now on we fix an iteration i > 1, and, with a slight abuse of notation, 
put S := -Li_i and L := Li. Let p := jSI/lLl. By definition, S is a random sample of L of size 
[n/ log*-*"^^ n] = p\L\. Given lev^^(S'), we first construct UH(lev^^(5)). 

Claim 3.3. The overall expected time to construct UH(lev^^(S')) is 0{\S\). 

Proof. Using easy manipulations on the DCEL representing lev^^(5'), we can first locate a vertex 
V of lev^(5'), and then proceed to its neighboring topmost vertex (say, to its left) by walking along 
its corresponding adjacent edge. We then continue progressing in this manner to the left. The 
vertices to the right of v are explored analogously. Thus we can extract the sequence of vertices 



(and edges) along lev^(5), ordered from left to right. By Claim 3.1, its expected size is OdS"!) 



Then we use Graham's scan on the resulting set of vertices. □ 

Next, we shoot vertical rays from each vertex of the hull UH(lev^^(5)) in the negative y- 
direction. This gives a collection 7^ ^ of semi-unbounded trapezoidal cells covering UH(lev^^(5)), 
and hence also lev^^(L), as is easily verified (see, e.g., 35 for similar arguments). We group 



the cells in 7^ ^ into 0{\T^ g\{p/k)) semi-unbounded vertical strips, each of which consists of k/p 
contiguous cells. Such a vertical strip is called a block. Every block is bounded by a convex chain 
from above, and by two vertical walls, one to its left and the other to its right. Let B be the set of 
all blocks. 

We say that a line £ G L is in conflict with a cell A G 7^5, Af] £ ^ f/i. The confiict list Ca 
is then the set of all lines £ £ L in conflict with A, and we put ca := |Ca|. We similarly define 
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(a) 



(b) 



Fig. 4: (a) The fc-level of A{S) is depicted by the hghtly-shaded polygonal line in the figure, and 
UH(lev^^(5')) is depicted by the dashed line. The line i' passes above UH(lev^^(S')), where 
i passes below v and thus meets UH(lev<^(S')). (b) The dual scene of (a). The concave 
chain 7 is the dual of UH(lev^^(S')). The line I is mapped to the point £* , where the pair 
of the dashed lines depict the visibility lines of i* to 7. The line i' is mapped to the point 
i'* lying below 7. 



conflict lists Cb and conflict sizes cb for each block B G B. Our next goal is to determine the 
conflict lists Cb for each block. 

Lemma 3.4. We can construct the conflict lists Cb, B G B, in overall time 

O na( 



<n) + \V-,iS)\ + Y,CB) 
BeB J 



Proof. First, we determine for each line i £ L one trapezoid G 7^ ^ such that i £ Ca^, if such 
a Ai exists. This is done by a walk in the dual plane, as described in the overview above and 
illustrated in Figure|4j Specifically, we dualize UH(lev^^(5')) to a concave chain 7. Using a similar 



technique as in Section 2.1 we walk along the zone of 7 in A{P*) in order to determine, for each 
point i* corresponding to a line i £ L, its orientation with respect to 7. When i* lies above 7, we 
find an edge of 7 that is visible from £* . Using the corresponding vertex V£ in the primal plane, 
we can determine a cell that is intersected by i. 

Next, we determine for each such line i a block B that conflicts with it, namely the block that 
contains A^. We then find all blocks B' with i £ Cb' through a bidirectional walk from B. That 
is, we can determine if i intersects the next block by checking whether £ intersects any of its walls 
(otherwise, £ intersects its convex chain). See Figures] 



The bound on the running time now follows using similar considerations as in Section 2.1 □ 



Our next goal is to determine the (< A;)-level clipped to B, for each B £ B. To this end, we use 
a variant of the technique of Everett et al. ^23j . 

Lemma 3.5. Let B £ B. The (< k)-level of L clipped to B can be constructed in time 0{cb logCB + 
{ruB + Cb) log^ k + a^), where cb ■= \Cb\, tub ■= |V'^(C'b) n B\, and ob is the number of vertices 
ofA{L) below UR{lev-^{CR)nB). 

Proof. We apply the algorithm of Cole et al. |17j in order to construct lev^(CB) H -B in time 
0{cb log Cb + {ruB + cb) log^ k). Note that this algorithm returns levr (C^) n B as an x-monotone 
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Fig. 5: The block B is depicted by the hghtly-shaded region. The hne li does not intersect any 
neighboring block, whereas £3 and £4 also meet the left neighbor of B. The line £2 meets 
both neighbors. 



polygonal chain C, ordered from left to right Next, we determine for each line ^ € its first and 
last intersections wi, W2 with C. (if they exist). Clearly, the portion of £ below UH(^) is either (i) 
the line segment W1W2 (if both intersections exist); (ii) a ray with an endpoint at wi (if wi is the 
only intersection with UH((^)) or (iii) the full line £ clipped to B (if it lies fully below 

These intersections can easily be determined in 0{mB) time by walking along C and recording for 
each line £ the first and last vertices of C that are incident to £ (if they exist); at the representation 
of we also store the incident lines within each vertex. A line that is not encountered during 
this process, does not meet and we can easily check whether it lies below As observed above, 
each of these portions (clipped to B) is either the (full) line £, a ray, or a line segment. Let C'^ 
be the resulting set of these portions; by construction, c'^ := \C'^\ < cb- Having this collection 
at hand, the computation of lev^^(Cs) n S is almost straightforward. Indeed, we use an optimal 



line segment intersection algorithm 12 , 16 in order to compute the arrangement of C'^ in time 
proportional to c'^ logc'^ + ub ^ cb logc^ + a^, where ub is the number of intersections between 
the elements of C'^. Note that some of these intersections may lie above the fc-level, as they are 
only guaranteed to be contained in UH((^). Thus, at the final step of the construction we eliminate 
such portions of the arrangement. This produces lev^^(CB)ni? = lev^^(L)ni?. A key observation 
is the fact that all these portions are actually contained in lev^^2fc_i)(C'_B) H B — see below. □ 

Finally, we glue all the resulting structures together and report lev^^(L). 



3.2 The Analysis 

We phrase our analysis below for a random subset 5 of L with |5| = p\L\, for some p € (0, 1), as 
the value of p varies at each iteration of the algorithm (as well as the final step). We begin with the 
following key lemma that bounds the total size of the large conflict sets. The proof is postponed 
toE 

Lemma 3.6. Let k, L, S, Tj, g, Ca, ca, p be defined as above. Then, for any sufficiently large 

^ The algorithm of Cole et al. proceeds with a rotational sweep in the dual plane that keeps k points of the input 
to the left of the sweep-line. In order to find only those vertices of the fc-level which lie inside B, we need to identify 
the appropriate initial orientation for this line, but this is easily done by inspecting the intersection of Cs with the 
left boundary of B and using a linear time selection algorithm Il8l Chapter 9]. 
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constant (3 > 1, we have 



Exp. 



^ CA (log c A + log^ A; + log ( 1 /p) ) 



k,S 



< \L\e~®'^^^^'^/P\ 



(2) 



Remarks. (1) The bound in Lemma 3.6 holds for any integer k G {k . . . 2k}. In particular, the 
analysis does not assume a linear complexity bound on any of the levels of A{S) (and A{L)); see 
Elfor further details. 

(2) It is easy to verify that the bound in Lemma 3.6 can be rewritten when we apply the summation 
over all blocks. That is, 



Expc 



^ CA(logCA + log^A; + log(l/p)) 



(3) 



Bounding the expected running time. Adding the bounds in Lemmas 3.4 and 3.5, the running 
time to construct lev^^(L) from lev^^(5) is asymptotically upper-bounded by 



na 



(n) + \V-^iS)\ + CBilogCB + log^ k) + niB log^ k + ae 
BeB 



(4) 



We bound each summand in turn. By Claim 3.1, we have Exp^[| V^(5)|] ^ \S\ < \L\ 
Claim 3.7. We have: 



Exp 



S,k 



CB(logCB + log^ k) 



.BeB 



< \L\ (log(l/p) + log2 k) 



Proof. We say that a line £ £ Cb is spanning for a block B €z B li i intersects both the left and 
the right walls of B, otherwise it is non-spanning. Note that every line can be non-spanning for at 
most two blocks. Let /3 > 1 be a sufficiently large constant. We say that a block B G B is light if 
it has at most fik/p^ spanning lines and if cb < fik^ /p^ . Otherwise, B is called heavy. 
We split the summation, as follows: 



^ cb {log CB + log^ k^ = ^ CB (log CB + log^ k^ + ^ CB (log CB + log^ 



k 



BeB 



BeB 

B is light 



BeB 
B is heavy 



(5) 



Let us first consider the sum over the light blocks. Let i? be a light block. By definition, we have 
logc^ ^ log(l/p) + log /c. Furthermore, write = + c^, where is the number of spanning 
lines in Cb , and is the number of non-spanning lines in . Observe that 

5^ c%«\B\Ck/p% 
BeB 

B is hght 

since each light block can have only 0{k/p^) spanning lines, and that 

B is Ught 
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since the total number of non-spanning lines is at most 2|L|, over all blocks in It follows that 



^ CB (log CB + log^ kj <C 



B is light 



See 

\B is light 



log(l/p) + log^ k 



« + log(l/p)+log2^ . 



Now we bound Since each block contains p//c contiguous cells, we have <^ IT^^Kp/^)- 

Furthermore, because the number of cells is bounded by the number of vertices on the /c-level, we 
get 

ExP5fc[|r^5l] «Exp^^[|y^(5)|] «Exps[|5|] = \L\p, 



using Claim 3.1 and the definition of p as |S'|/|L|. Thus, 

E^Ps-,[\B\Ck/p')] « E^pg-,[\rj,J/p] « |L|. 

Therefore, 



Exp 



S,k 



B is light 



CB(logCB + log^ k) 



< \L\ (log(l/p) + log^ k) . 



To bound the sum over the heavy blocks in ([s]), observe that by definition a heavy block B 
must contain a cell A with ca > (ik/p^: either there are more than fik/p^ spanning lines, in which 
case all the cells in B have this property, or cb > (3k'^/p^, in which case the claim follows from the 
fact that B contains only k/p cells. Let A* G -B be the cell that maximizes ca for A £ B. Clearly, 
we have ca* > Pk/p^ and cb < {k/p)c^*. Hence, 



BeB 

B is heavy 



^ CB flog CB + log^ kj ^ (k/p) ■ CA f log (cAk/p) + log^ k 



BeB _ 

AeB,CA>l3k/p'^ 



< (k/p) CA (log CA + log^ k + log(l/p)) . 



B£B 
AeB,CA>l3~k/p^ 



By Q, the expectation (over S) of the latter sum is at most 

(fc/p)|L|e-®(^)(^-/P) < |L|, 

for f3 sufficiently large. □ 

Remark. In the analysis of Lemma |3.7| concerning the bound for the heavy blocks, each such 
block B may consist of both heavy cells (that is, cells A with ca > fik/p^) and light ones. At 
first glance, one may suspect that the overall contribution of the light cells should have the bound 
0(|L|(log(l/p) +log^/c)), as obtained in the case for light blocks. Nevertheless, since these cells 
belong to a heavy block, the actual bound is smaller, and in fact follows from the property that the 
number of heavy blocks is eventually much smaller than the number of light blocks (this property 



is an easy consequence of Lemma 3.6). 
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Claim 3.8. We have: ^^ggm^log^A; ^ \L\k. 

Proof. Recall that itlb = \V^{Cb) H B\ = \V^{L) n B\. Using Dey's bound on the size of the 
, it follows that ^BeB^B = ^ The claim is now immediate. □ 



/c-level 



21 



Claim 3.9. We have: X^BeB^^ ^ \L\k. 

Proof. Everett et al. |23| have shown that no element in C'^ contains a point which lies above 
^^^2fc-i(^^) ^ ^- Since all sets C'^ are clipped to B, for each B G B, it follows that all portions of 
the various arrangements that we construct, over all B G B, lie within lev^2fc_i(-^)- Hence, 

^ OB < \L\k < \L\k. 



□ 



We thus conclude: 

Corollary 3.10. The total expected running time for the ith iteration is 

\u 



O ( na{n) + \Li\ [k + log ^ 



Proof. This follows by substituting the bounds from Claims 3.7 3.9 into M|, by using that log^ k 



0{k), and by remembering that we set L = Li., S = and p = |5|/|-L|. □ 
Note that 

log* n— log* fc+l 

\Li\k = nk / log^^^ n <^ nk, 



5* n— log* fc+l log* n— log* fc+1 

E 

i=2 1=2 
I 2 



since the sequence {l/ log*-*^ n} .^j^^* ^ decreases faster than any geometric sequence. More- 
over, for all but the last iteration, we have |Li| log(|Lj|/|Lj_i|) <^ |Lj|log^*^n <C n. At the last 
iteration, we have IS*! > n/k, so logdLl/IS"!) < log A;, and thus 

log* n— log* fc+1 

\Li\ log^*-* n + \L\ logk <^ n(log* n — log* k + log A;). 

1=2 

It thus follows that the overall expected running time is 0(na(n) (log* n — log* k) + nk). It is easy 
to verify that when k = we obtain the same asymptotic time bound as in Theorem |2.2[ 



A faster algorithm under the obliviousness assumption. Similar to Section 2.2 the expected 
running time can be improved to 0{n{a{n)+k)) assuming obliviousness. As before, we now compute 
a gradation during the preprocessing phase: Pi ^ P2 ^ • • • ^ -Pi+ioglogn = P with |i-*i| <C n/logn 
and |-Pj| = 2|Pj_i|, and we compute each of the arrangements A{P*) in the dual plane. 

The algorithm for processing a set of lines L, with each line containing exactly one point, is 
just as above, with two major differences: first, we compute the gradation for L by using the 
precomputed gradation for P. Second, during the ith iteration we use A{P*) instead of A{P*) to 



determine the zone of 7. Using similar considerations as in Section [2. 2[ the bound in Corollary 3.10 
now becomes 0{\Li\{a{n) + k)), because log(|Lj|/|Lj_i|) = 1. Summing over the various iterations 
i and the final step yields the bound 0{n{a{n) + k)), as asserted. The total storage requirement 
remains O(n^). 



This at last concludes the proof of Theorem 3.2 
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4 Lower Bounds 



In this section we study problems where preprocessing A{L) is unhkely to decrease the query time 
to o(n log n) (at least under some computational models). 

Delaunay triangulations. It has already been observed in [8 ,30 that for some sets L, even when 



we have A{L) precomputed, there are point sets, with each point lying on a distinct line, such that 
their Delaunay triangulation cannot be constructed in o(n log n) time (albeit sometimes one can 



obtain better bounds if each point lies on a fat region given in advance [8 ,|34 ). This lower bound 
holds in the classic algebraic computation tree model [3| Chapter 16], and it essentially comes from 
a construction due to Djidjev and Lingas [22]. Specifically, they showed that when the points are 
sorted in just a single direction, one cannot compute their Delaunay triangulation in less than 
0(n log n) time. Thus, if L is a set of vertical lines, we can only anticipate the j;-order of the points 
(received later), from which the lower bound follows. Note that this lower bound also implies that 
no speedup is possible for computing the Euclidean minimum spanning tree (EMST), since the 
Delaunay triangulation can be constructed in linear time once the EMST is known [14||29| . 
Closest Pairs. Finding the closest pair in a point set is somewhat easier than the Delaunay 
triangulation problem (since the latter has an edge between the closest pair [g] ) , but is often harder 
than computing convex hulls (except perhaps when the model of computation provides the floor 
function as well as a source of randomness, see, e.g., [27]). Formally, the problem is defined as 
follows: given a set L = {ii, . . . of lines in the plane, compute a data structure such that 
given any point set P = {pi, . . . ,pn} with S for i = 1, . . . , n, we can quickly find a pair 
(PiiPj) G -P X P of distinct points that minimizes \\pi — Pj\\- Incorporating the lower bound by 
Djidjev and Lingas ^22j, we show the following: 

Proposition 4.1. There exists a set L = {ii, . . . , in} of lines in the plane, such that for any point 
set P with each point lying on a distinct line of L, finding the closest pair in P (after preprocessing 
L) requires J7(nlogn) operations under the algebraic computation tree model. 

Proof. Consider the problem Fuzzy-2-Separation: for a sequence xi, . . . ,x„ in M, output No, if 
there exists a pair 1 < i < j < n with \xi — Xj\ < 1, and Yes, if for each pair 1 < i < j < n we 
have |xj — > 2. In all other cases the answer is arbitrary. 

Claim 4.2. Any algebraic decision tree for the problem Fuzzy-2-Separation has depth Q{n log n) . 

Proof. This follows from a straightforward application of the technique of Ben-Or [s]. The only 
somewhat non-standard feature is the need to deal with fuzziness. Let 



Wi = {(xi, . . . ,x„) G M" I VI < i < j < n : |xj - Xjl > 1}, 

and let 

W2 = {{xi,...,Xn) £ R"' \ y 1 < i < j < n : \xi - Xj\ > 2}. 

Let T be a decision tree for Fuzzy-2-Separation, and let W = r~^(YES) C M" be the 
set of inputs that lead to a leaf in T labeled Yes. By definition, we have Wi 1^ W D W2- It 
now follows that W has at least n! different connected components, since the n! inputs Xj^ = 
(27r(l), 27r(2), . . . , 27r(n)) for any permutation vr of {1, . . . ,n} are all contained in W2 and reside in 
different connected components of Vl^i (see [sj Theorem 16.20] for this standard technique). Hence, 
Ben-Or's result [5j implies that T has depth 0(logn!) = i7(nlogn). □ 
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The reduction from Fuzzy-2-Separation to closest pair queries is almost straightforward. For 
i = 1, ... ,n, let ii be the horizontal line ii : y = i/n, and let L = {£i, . . . Thus, the only 

information we can precompute from L is exactly this order. Given an instance {xi, . . . ,Xn) of 
Fuzzy-2-Separation, we map each Xi to a point pi = {xi,i/n) £ ii, and then find the closest 
pair in the resulting point set. If the distance of the closest pair is greater than 2, our algorithm 
outputs Yes, otherwise it outputs No. Clearly, the overhead for this reduction is linear. We are 
now left to show the correctness of the reduction. Indeed, if \xi — Xj\ > 2, for every pair of indices 
I- ^ i < j ^ n, then clearly \\pi — Pj\\ > + l/n^ > 2, and this in particular applies for the 
closest pair of points. Otherwise, if there exists a pair 1 < i < j < n with \xi — Xj\ < 1, then 
\\Pi — PjW < Vl + 1 < 2 (and this also upper bounds the distance between the closest pair), so the 
reduction reports the correct answer on all mandatory Yes and No instances, as asserted. □ 

Convex hull in three dimensions. Returning to the convex hull problem, we next study its 
extension to three dimensions. That is, given a set H = {hi, h2, . . . , hn} of n planes in M^, we 
would like to compute a data structure, so that for any point set P = {pi, . . . ,pn} with pi G hi, 
i = 1, . . . ,n, we can construct CH(P) quickly. Since the complexity of the convex hull in both 
and is only linear, and since there are several algorithms that construct the convex hull 
(in both cases) in the same asymptotic running time (see, e.g., [6|[T6]), one may ask if a three- 
dimensional convex hull query can be answered in o(n log n) time as well. Using the well-known 
lifting transformation [41 1, one can quickly derive a lower bound from the result about Delaunay 
triangulations mentioned above, but below we also give simple direct reduction (which follows 
immediately from a result of Seidel [39|). 

Proposition 4.3. There is a set H = {hi, . . . , hn} of planes in M^, such that for any point set P 
with each point lying on a distinct plane of H , constructing CII(P) (after preprocessing H) requires 
r2(nlogn) operations under the algebraic computation tree model. 

Proof. Let hi be the plane defined by the equation z = i, for i = 1, . . . ,n, and let H = {hi, . . . , hn}. 
We give a reduction from planar convex hulls to computing three-dimensional convex hulls of point 
sets, where each plane in H contains precisely one such point. Let P = {pi, . . . ,pn} be a set of 
points in the plane, and, for i = 1, . . .n, let pi := (pixiPiy^i), that is, the point obtained by lifting 
Pi to hi. As observed by Seidel [39| Section IV], to compute the planar convex hull CII(P), it 
suffices to perform a convex hull query for P = {pi,P2, ■ ■ ■ ,Pn} and then project the result onto 



the xy-plane. It is shown in 39 that once we have CH(P) at hand, the time to project it onto the 
j;y-plane (and then extract the actual planar convex hull) is only 0{n). Thus the overhead of the 
reduction is linear, as is easily verified. The result now follows from the standard r2(n log n) lower 
bound for planar convex hulls in the algebraic computation tree model (see, e.g., [5]). □ 

Sorting. Interestingly, a similar approach also shows that sorting requires f2(nlogn) operations 
under the algebraic computation tree model. We have a set L = {£i, . . . , in} of n lines in the plane, 
and we wish to compute a data structure such that for any set P = {pi, . . . ,pn} of points with 
Pi £ ii, i = 1, . . . ,n, we can quickly sort these points according to their x-order. 

Proposition 4.4. There exists a set L = {ii, . . . ,in} of lines in the plane, such that for any 
point set P with each point lying on a distinct line of L, sorting P according to its x-order (after 
preprocessing L) requires J7(nlogn) operations under the algebraic computation tree model. 
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Proof. Let X = {xi, . . . C R. For i = 1, . . . ,n, let it be the line li : y = i, and let L = 
{ii, . . . , in}- We now lift each Xi on ^j, and obtain the point pi := {xi, i), i = 1, . . . ,n] let P denote 
this set of points. It is now easy to see that the x-order of P yields the sorted order for the numbers 
in X, and that this reduction has a linear running time. □ 



5 Concluding remarks 



Note that Proposition 4.4 which has a straightforward proof, has an intriguing implication em- 
phasizing a main contribution of this paper: while the "standard" planar convex hull and sorting 
problems are basically equivalent in terms of hardness (e.g., |6]), in our setting convex hull queries 
are in fact easier. This improvement stems from the "output-sensitive nature" of convex hulls: 
points inside the hull are irrelevant to the computation, and the information provided by L, com- 
bined with our update technique, allows us to quickly discard those non-extremal points, and not 
further process them in following iterations. In our setting the two problems become equivalent if 



the input points are in convex position. Then, Proposition 4.4 does not apply, since the points are 
sorted along two directions, and having the order according to one of them immediately implies 
the order according to the other. 

Our study raises several open problems. The first one is whether the log* n factor in the query 
time bound is indeed necessary for both convex hull and (< /c)-level queries. We conjecture it to be 
an artifact of the technique and that the actual running times are 0{na{n)) and 0{n{a{n) + A;)) 
for the two respective problems (as in the obliviousness model). Another problem concerns the 
case of convex hulls for points restricted to three-dimensional lines. In this case, the lower bound 
in Section |4] does not apply. Moreover, if the lines are parallel, a simple variant of our approach 
yields expected query time 0(n log log n) with polynomial preprocessing and storage. Is there a 
better bound? What happens in the general case? 
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A Levels in Arrangements 



Proof of Lemma 3.6: We actually consider the sum over all A ^ g with ca > 2(3' k/p, where 
/3' > 1 is a constant to be fixed shortly. Then the lemma follows by choosing /3 > 2/3'. In what 
follows, with a slight abuse of notation, we denote (3' by f3. For every vertex v of A{L), let Cy be 
the set of lines intersecting the (open) downward vertical ray emanating from v, and put := \Cv\. 
Every vertex of lev^(S') bounds at most two cells in 7^ ^, and for every A G ^ any line in Ca 
passes under at least one vertex of A. Thus, we have ca < 2 max{c„^ , Ctjj}, where vi, V2 are the 
two vertices of A. We thus have: 

^ CA(logCA + log^fc + log(l/p)) < ^ c^(logc^ + log^A; + log(l/p)). 



CA>2/3fc/p2 



Cv>l3k/p'^ 



Now, let f be a vertex of A{L) at level > {k/p^) — 1, and let ^1,^2 be the two lines defining v. 
The vertex v appears in Vj^{S) precisely if (i) li and I2 are in S; and (ii) S contains k — loik lines 
below V. Thus, 

Vt[v G Vr{S)] = Pr[{£i,£2} C 5 A ISnai G - l,k}] 



\L\ 
p\L\ 



L\ 
p\L\ 



■ Pr 



\SnCy\ e{k-i,k}\ {ii,i2} ^ S 



Conditioned on S containing {^1,^2}, the sample S' := S \ {^1,^2} is a random {p\L\ — 2)-sample 
from the set L' := L\{ii,£2}. Hence, |S"nC^| follows a hypergeometric distribution, so Hoeffding's 
bound (15,26 implies that 



Pr[|5na| £{k-l,k}\ {4,^2} ^S]< Pr[|S'nC^,|/|5'| < k/\S'\ 



Pi[\S' nCy\/\S'\ > l-k/\S'\] < 



cJ\L'\V-'/\''WcJ\L'\ 



k/\S'\' 



\S'\ 



l-k/\S'\J \k/\S'\ 
recalling that c„ = |C„| denotes the number of lines below v. Now note that 

p\L'\ = p{\L\ -2) = \S\-2p= \S'\ + 2-2p. 
Thus, writing = t ■ k/p, for some appropriate t > j3/p, we get 

Pr[|5na| G {fc-l,fc} I {h,l2} C S] 

\ l-tk/{\S'\+2{l-p)) \i-k/\s'\ nk/{\S'\+2{l-p)) \k/\s'\ 
V l-kl\S'\ / " V kl\S'\ / 



\s'\ 



< 
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To simplify this, we first observe that 



tk 



< 1 



tk 



3tk 



\S'\+2{l-p)- \S'\+2- \S'\ + \S'\/3 4|5'|' 

since we can assume S' is large enough so that 2 < |5"|/3. Therefore, 

I -tk/{\S'\+ 2(1 -p)) ^ l-i3t/4)k/\S'\ i3t/A-l)k/\S'\ 
l-k/\S'\ ~ l-k/\S'\ ^-k/\S'\ 

For the other term, we calculate 

tk/{\S'\+2{l-p)) _^ \S'\ ^ ^ 

~k/\S'\ \S'\+2{l-p) - ■ 

Therefore, we can bound the probability as 

[I - ^'t-Y/isf ' j 

< exp (-(3t/4 - 1 - logt)k^ < exp (^-tk/2] , 
for t > f3/p large enough. We next observe that 



\s'\ 



\L\-2\^^\L\ 



,p\L\-2j'\p\L\J {p\L\ - 2)li\L\ - p\L\)\ ' 
in order to conclude that 



(|L|-2)! (p|L|)!(|L|-p|L|)! p\L\ p\L\ - 1 



|L|! 



\L\ \L\-1 



< p 



\snc^\ e{k-i,k}\ {£i,e2} ^ s 

< p^ ■ exp ( -tk/2 

Now we can finally bound the expectation as follows: 



Exp 



^ c„(logc„ + log^fc + log(l/p)) 

.Cv>l3k/p^ 



^ Fi[v G Vj^iS)] c,(logc„ + log2 k + log 
(grouping by level, using Q, and letting Ic denote the number of vertices in leVc(-L)) 



\L\ 

E 



< Ic p'^ e'^P^^ c{logc + log^k + log {I /p)) 
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(bounding the sum by an integral and using Ic = 0{\L\c^/^) [21]) 

/•oo 

< / |L|c^/Ve"^^^^c(logc + log2^ + log(l/p))dc 

(substituting c = tk/p and using dc = {k/p)dt) 

|L|(tfc/p)i/Ve"*^/^ {tk/p){logit/p) + log2 k + log {l/p))Ck/p) dt 



it=p/p 

(collecting the terms and simplifying) 

< \L\{p/p) / fe'^^/'^dt 

Jt=l3/p 

(solving the integral) 

= |L|(P/p)(2 • iP/pfk-^ + 8(/3/p)fe-2 + l6fc-3)e-^^/2p 

(simplifying) 

for /3 large enough, as desired. □ 
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